A Holistic Paradigm for Schema Matching∗

نویسندگان

  • Bin He
  • Kevin Chen-Chuan Chang
چکیده

Schema matching is a critical problem for integrating heterogeneous information sources. Traditionally, the problem of matching multiple schemas has essentially relied on finding pairwise-attribute correspondence. In contrast, we propose a new matching paradigm, holistic schema matching, to holistically match many schemas at the same time and find all the matchings at once. By handling a set of schemas together, we can explore their context information that reflects the semantic correspondences among attributes, which is not available when schemas are matched only in pairs. As the realizations of the holistic paradigm, we developed two alternative approaches recently. This article takes an initial step to unify those two approaches and further contrasts their strength and weakness. Specifically, we develop two alternative methods for realizing holistic schema matching: global evaluation and local evaluation. Global evaluation exhaustively assesses all the possible models, where a model expresses all attribute matchings. In particular, we propose the MGS framework for such global evaluation with the hypothesis of the existence of generative models. On the other hand, local evaluation independently assesses every single matching to incrementally construct the model. In particular, we develop the DCM framework for such local evaluation with the observation that co-occurrence patterns across schemas often reveal the complex relationships of attributes. We apply our approaches on matching Web query interfaces on the deep Web. The result shows the effectiveness of both the MGS and DCM approaches, which together demonstrate the promise of the holistic paradigm for schema matching.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

A Linear Program for Holistic Matching: Assessment on Schema Matching Benchmark

Schema matching is a key task in several applications such as data integration and ontology engineering. All application fields require the matching of several schemes also known as ”holistic matching”, but the difficulty of the problem spawned much more attention to pairwise schema matching rather than the latter. In this paper, we propose a new approach for holistic matching. We suggest model...

متن کامل

Holistic Schema Matching for Web Query Interface

One significant part of today’s Web is Web databases, which can dynamically provide information in response to user queries. To help users submit queries to and collect query results from different Web databases, the query interface matching problem needs to be addressed. To solve this problem, we propose a new complex schema matching approach, Holistic Schema Matching (HSM). By examining the q...

متن کامل

Schema Matching And Mapping-based Data Integration

We propose a flexible framework called MOMA for mapping-based object Object matching or object consolidation is a crucial task for data integration and 370, COMA A System for Flexible Combination of Schema Matching Approaches. Schema matching and mapping are an important tasks for many applications, such as data integration, data warehousing and e-commerce. First and foremost our approach is ba...

متن کامل

A Survey Of Approaches To Automatic Schema Matching Bibtex

ous business domains for automatic discovery, mediation and invocation of services over the Core or BibTEX. The use of such an Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB Journal: Very. BibTeX. @MISC(Rizopoulos_schemamatching, author = (Nikos Rizopoulos), 1126, A survey of approaches to automatic schema matching Rahm, Bernstein. The Linked Data paradigm ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004